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Abstract: Long-PCR amplification, clone and primer-walking sequencing methods were employed in determine the 
complete sequence of mitochondrial genome of tokay ( Gekko gecko). The genome is 16 435 bp in size, contains 13 pro- 
tein-coding, 2 ribosomal and 22 transfer RNA genes. The mt genome of Gekko is similar to most of the vertebrates in gene 
components, order, orientation, tRNA structures, low percentage of guanine and high percentage of thymine, and skews of 
base GC and AT. Base A was preferred at third codon positions for protein genes is similar to amphibians and fishes rather 
than amnion vertebrates. The standard stop codes (TAA) present only in three protein genes, less than those of most ver- 
tebrates. Transfer RNA genes range in length from 63 to 76 nt, their planar structure present characteristic clover leaf, ex- 
cept for tRNA-Cys and tRNA-Ser (АСУ) because of lacking the D arm. 
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The animal mitochondrial (mt) gene content is 
highly conserved, it usually encodes 13 proteins, 2 ri- 
bosomal RNAs (rRNAs), and 22 transfer RNAs (tR- 
NAs). The gene order is also highly conserved among 
most vertebrates (Boore, 1999). However, gene loss 
and gene rearrangement have been found in some taxa 
(Joshua et al, 2003). Comparisons of mitochondrial 
systems are useful for modeling genome evolution and 
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phylogenetic inference (Boore, 2004). They include 
gene content and gene arrangement; base composition ; 
modes of replication and transcription; protein, tRNA 
and rRNA gene secondary structures; and genetic 
codon variations. These features are currently more ac- 
cessible for study in the much smaller and simpler mito- 
chondrial genomes (Garesse et al, 1997; Boore, 
2004). During the last ten years, mitochondrial 
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genome sequence and gene arrangement comparisons 
were employed as powerful new tools for resolving an- 
cient phylogenetic relationships (Zardoya & Meyer, 
1998; Janke et al, 2001; Zhou, 2001; Joshua et al, 
2003; Wu et al, 2003; Boore, 2004). 

Geckos and pygopods comprise the Gekkota, one 
of three major lineages of living lizards and the only 
such lineage to be primarily nocturnal. Current esti- 
mates of gekkotan diversity recognize approximately 
1 110 species іп 116 genera (Kluge, 2001; Bauer, 
2002) and many new taxa are described each year. Al- 
though partial sequences of ND1, ND2, CO1, 12S, 
tRNAs, 165 and Cyt b genes, etc. from gekkonid 
lizards have been published (Donnellan et al, 1999; 
Han et al, 2001; Macey et al, 1999; Nogales et al, 
1998; Austin et al, 2004; Zhou, 2001), no complete 
mitochondrial sequence has been reported from any of 
the geckos so far. In this article, the gene sequences 
and organization of mitochondrial genome of tokay 
(Gekko gecko ) are reported. 


1 Materials and Methods 


1.1 Specimen information 

The specimen of G. gecko was obtained in Nan- 
ning, Guangxi Province in China. The fresh organs 
were frozen immediately in liquid nitrogen and stored 
under - 80 €. 

1.2 PCR amplification, cloning and sequencing 

Mitochondrial DNA was extracted from liver tissue 
following the procedure described in Arnason et al 
(1991). 

We obtained ап mtDNA sequence of Gekko within 
№04 using the primer pair (LND4: 5'-CAG TAT ATG 
GGC TTC ATC ТС-3'; HLEU: 5'-TAC ТТТ TAC TTG 
GAT TTG CAC CA-3') designed based on alignment of 
reptile mtDNAs, and a sequence of 12S rRNA frag- 
ment amplified by universal primers L1091 and H1478. 
Two pairs of primers were synthesized for long-PCR am- 
plification based on the above two sequences: C12SL: 
5'-AGG ТСС АСС ТСТ АСС AAA ССА САС ССА 
AGA GAT G-3'; GND4H: 5’-GCG АТС АТС GAT 
TTG AGA ТСТ СТТ TGA CGC AGG C-3'and GND4L: 
5'-АТТ CAC ACA CGA ACC СТС АТС TTA ACA 
CGA ССС С-3'; G12SH: 5'-GCT CCT CTA GGA TGA 
TAT GGA ACA CCG TCA AGT С-3'. Two fragments of 
about 10 kb and 6 kb respectively were obtained using 
these two pairs of primers by long-PCR amplification 
(Expand Long Template PCR System, Roche). 

The above two fragments were cleaved with restric- 
tion enzyme Hind Ш, EcoR І, Pst I and Асс І. 
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Fragments suitable in size were cloned using either the 
TA cloning method utilizing the plasmid vector pUC19, 
or cloning to vector pGEM-3Zf (+) by cloning kit 
(Takara) according to the manufacturer’s protocol. The 
primer walking strategy was used for the fragments 
longer than 1 kb. 

Both cloning and PCR fragments were sequenced 
from both ends with internal primers using ABI PRISM 
310 sequencer (Perkin Elmer). All overlaps between 
the clones were confirmed by direct sequencing of PCR 
products amplified from the purified mtDNA template. 
The whole mitochondrial genome was read at least two 
times. 

1.3 Data analysis 

The gene sequences of Gekko mitochondrial 
genome were identified by sequence comparison with 
published reptile gene sequences using Clustal X 1.8 
(Thompson et al, 1999). This program was also used 
to calculate the following data: percentage of total base 
substitutions; base percentages of noncoding and con- 
trol region (s); transition/transversion . 

Start and stop codons were used to help defining 
the sequences of protein-coding genes. In addition, tR- 
NA genes were folded to verify their secondary struc- 
tures using program RNAdraw and improved manually. 
Gene compactness, percentage of GC base component, 
Base skew overall, and base skew at 3rd codon position 
for protein-coding genes, etc. were also calculated to 
analyze the genome characters. 

Three other mitochondrial genome sequences ге- 
trieved from GenBank were analyzed and calculated al- 
so in this study for comparing the above characters: 
Eumeces egregius (GenBank accession no. AB016606; 
Kumazawa & Nishida, 1999); Iguana iguana (Gen- 
Bank accession no. AJ404872, Janke et al, 2001); 
Dinodon semicarinatus ( GenBank accession по. 
AB008539, Kumazawa et al, 1998). Although the se- 
quences used in this study are retrieved from defined 
taxonomic entities (species/subspecies), for the sake 
of brevity, we refer to these taxa only by their genus 
names in the text. 


2 Results and Analyses 


The complete mtDNA sequence has been deter- 
mined for С. gecko (GenBank accession по. 
AY282753). The mtDNA is 16 435 bp in size, en- 
codes 13 protein genes, 22 tRNA genes, and 2 rRNA 
genes. Both gene content and gene order are typical of 
vertebrates (Saccone et al, 1999). With respects to the 
length of intergenic spacers and overlaps, Gekko has a 
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rather compact genome compared with the other three 
squamates (Tab. 1 and Tab. 2). The overall base 
composition of Gekko mtDNA for the L-strand is: A, 
32.3%; C, 27.6%; G, 13.6%; and T, 26.4%. 
Guanine (G) is the rarest nucleotide and GC content is 
41.2%. All the Gekko mitochondrial protein-coding 
genes start with an ATG codon except ND2 and Cyt b 
which start with ATA, and ND3 and №05 which start 
with GTG, respectively (Tab. 1). Four protein-coding 
genes terminate with TAA, two end with TAG, two end 
with AGA, and each of the other five has an incomplete 





stop codon, a single stop nucleotide T, where the post- 
transcriptional polyadenylation can produce a standard 
TAA stop codon. The direction or the encoding-strand 
selection of the genes is identical to the typical verte- 
brates (Tab. 1). 

The 22 tRNA genes range from 63 to 76 nt in 
length. Most of them could be folded into the canonical 
cloverleaf secondary structure (Fig. 1). The complete 
dihydrouridine arms (D-arms) are lacking in tRNACys 
and tRNASer (АСУ). 

The stem and loop structure of origin of the puta 


Tab. 1 Organization and features of Gekko gecko mitochondrial genome 


Start Stop Size 


1 
codon codon (bp) = Strand 





Gene/Regin Start Stop Spacer( + ) 
position position — Overlap( - ) 
tRNA-Phe 1 73 
12S rRNA 74 1 033 +3 
tRNA-Val 1 037 I 106 
16S rRNA I 107 2 693 
tRNA-Leu(UUR) 2 694 2 769 
М1 2 770 3 735 +6 
{КМА -Пе 3 742 3 813 -1 
tRNA-Gln 3 813 3 885 -1 
tRNA-Met 3 885 3 954 +4 
ND2 3 959 4 996 -2 
tRNA-Trp 4 995 5 066 +5 
tRNA-Ala 5 072 5 140 
tRNA-Asn 5 141 5 213 
OL 5 216 5 247 -4 
tRNA-Cys 5 244 5 306 
tRNA-Tyr 5 307 5 372 +4 
col 5 377 6 921 -7 
tRNA-Ser(UCN) 6 915 6 989 
tRNA-Asp 6 990 7 057 
col 7 058 7 750 -8 
tRNA-Lys 7 743 7 811 +3 
АТРазе 8 7 815 7 974 - 14 
АТРазе 6 7 961 8 647 -1 
coll 8 647 9 430 
tRNA-Gly 9 431 9 499 
ND3 9 500 9 839 
tRNA-Arg 9 840 9 908 
ND4L 9 909 10 205 -7 
ND4 10 199 11 582 +12 
tRNA-His 11 571 11 642 
tRNA-Ser (АСУ) 11 643 11 707 
tRNA-Leu (СОМ) 11 708 11 778 -21 
NDS 11 758 13 567 +4 
ND6 13 564 14 073 
tRNA-Glu 14 074 14 149 - 10 
Cyt b 14 140 15 287 
tRNA-Thr 15 288 15 357 
tRNA-Pro 15 358 15 423 -2 


73 
960 
70 
1587 
76 

ATG TAA 966 321 
72 
73 
70 

1 038 345 
72 
69 
73 
32 
63 
66 

1 545 514 
75 
68 

ATG AGA 693 230 
69 

ATG Т+ + 160 53 

АТС ТАА 687 228 

АТС Т+ + 784 261 
69 

GTG Т+ + 340 113 
69 

АТС ТАА 297 98 

АТС Т+ + 1 384 461 
72 
65 
71 

GTG Т+ + 1810 603 

АТС ТАС 510 169 
76 

1 152 383 
70 
66 


= 


ATA TAG 


сете 
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АТС АСА 
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! H = heavy strand; L= light strand. 
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Tab. 2 Comparisons of mitochondrial genome features in four squamates 
Genome character Gekko Dinodon Eumeces Iguana 
Gene compactness! +41; -78 +97; -33 +172; -24 +56; -22 
GC base component( 96 ) 41.2 45.2 44.2 39.9 
Base skew overall GC= -0.34; GC= -0.40; GC = -0.30; GCz - 0.39; 
AT=0.10 AT=0.18 AT=0.09 AT 20.16 
Base skew at 3rd codon position СС= - 0.52; СС = -0.78; СС= - 0.56; СС = - 0.80; 
АТ=0.22 АТ=0.48 АТ=0.36 АТ=0.57 
lu 4”: Spacer; “-”: Overlap. Control region(s) is excluded. 
Tab. 3 Comparison of organization in noncoding region(s) among Gekko and 3 squamates 
Total NC CR TER CR YCR 
Taxon Reference 
(bp) (%) Lenth Tan-rep Lenth Тап-гер Tan-rep Tan-rep 
(bp  (%) (bp (%) 
Сео 16 435 6.6 1013 36.5 PE — 2х75+22; 2x 54; = This study 
2x9; 2x36 
Dinodon 17 191 12.3 1018 18.4 1055 17.7 2х2; 3х49 2х20; 3х49 Kumazawa et al, 1998 
Eumeces 17 407 11.7 1 863 39.9 245° 98.0 5х 83 +32; 2x56; 2х 120 Kumazawa & Nishida, 1999 
2x71 +43 
Iguana 16 633 7.5 1 191 9.4 I — 2x38; 3x12 = Janke et al, 2001 


NC = Nocoding region; CR = Control region; WCR = Second control or noncoding region. * There is an 83 - bp overlap between the first repeat unit 


(5') and Cyt b gene (3') sequence, and another 30 – bp overlap between the second repeat unit (3') and the tRNAThr gene (5') sequence. 


tive light-strand replication locates between the tRNA- 
Asn and tRNA-Cys. The D-loop (control region) locat- 
ed between tRNAPro and tRNAPhe is 1 013 bp in size. 
3 Discussion 
3.1 Base composition and nucleotide bias 

Vertebrate animal mitochondrial genomes deviate 
from a random usage of nucleotide. Saccone (1999) 
used the formula, base-skew = (А - T/A +T) ог (G- 
C/G + С), to evaluate the degree of the base bias, and 
found all the values of GC-skew were negative while all 
the values of AT-skew were positive in amniotes. The 
base bias of Gekko and the other three squamates (Tab. 
2) accords with the feature, and we can find the base 
bias overall (GC = — 0.34; AT 20.10) of the Gekko is 
most similar to Eumeces (GC = —0.30; AT=0.09). 

The percentage of total base substitutions and gaps 
between Gekko and Dinodon (38.396) is higher than 
that between Gekko and each of the other two lizards 
( Gekko / Eumeces = 33.6 96 ; Gekko / Iguana = 32.8 
96), respectively; the percentage of  transition/ 
transversion between Gekko and Dinodon (73.0%) is 
much lower than that between Gekko and each of the 
other two lizards ( Gekko / Еитесеѕ = 92.2%; Gekko/ 
Iguana = 84.6%). 
3.2 Protein-coding genes 

Base composition of vertebrate protein-coding 
genes is characterized by positive bias for T and nega- 


tive bias for G in the second positions, but the devia- 
tions in the third positions of synonymous codons might 
reflect more of this feature for the light directional muta- 
tion pressure in these positions (Saccone et al, 1999). 
While all the amniotes have a positive bias in the third 
positions for C and A, most of amphibians and fishes 
have only a positive bias for A in this positions (Janke 
et al, 2001), and the base bias of Gekko (А = 38.8; C 
= 27.6; G=8.7; Т=24.9) is similar to the later. 
3.3 Transfer RNAs 

The complete missing of D-arm of tRNASer 
(АСУ) has also been found in the mtDNAs of the black 
spotted frog Rana nigromaculata (Sumida et al, 2001) 
and the African side-necked turtle Pelomedusa subrufa 
(Zardoya & Meyer, 1998). Gekko tRNA anticodon 
triplet sequences are exactly the same as in other verte- 
brates, and therefore they likely use the same genetic 
codons. As in other vertebrates, folding of the tRNA 
sequences required the formation of G + T and other 
atypical pairings (Fig. 1). 
3.4 Noncoding regions 

The stem-and-loop structure of origin for the puta- 
tive light-strand replication of most vertebrates locates 
between tRNAAsn and tRNACys, which are absent in 
the African side-necked turtle ( Pelomedusa subrufa ) , 
Chinese alligator ( Alligator sininsis ), Texas blind 
snake ( Typhlonectes natans ) , tuatara ( Sphenodon pun- 
ctatus )and some birds (Zardoya & Meyer, 1998; Wu 
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et al, 2003; Seutin et al, 1994; Kumazawa & Nishi- 
da, 1995; Desjardins & Morais, 1990), and can be 
easily identified at the corresponding position in 
Gekko . 

The sequence of the control region (CR) located 
between tRNAPro and tRNAPhe is 1 013 bp in size and 
consists of 6.6% of the whole genome. This percent- 
age is slightly lower than that of Eumeces (7.5%) and 
much lower than Dinodon (12.3%). Some structural 
features of the noncoding regions and their tandem- 
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